Optimized implementation of an MVC decoder

نویسندگان

  • Jochen Britz
  • Thorsten Herfet
  • Goran Petrovic
چکیده

3D video is getting more popular in various applications. For a 3D video experience, at least two different views of the same scene are necessary. Despite serious interests in consumer industries and open-source communities, the main challenge in designing a real-time 3D communication system is that by today several mandatory components required for such a system are generally not available. In particular, to the best of our knowledge, no MVC-compatible software decoder currently achieves real-time performance. In our work, we address the challenge of implementing an open-source decoder for multi-view video (MVV) representations. We focus on the design and implementation of a real-time decoder based on the FFmpeg framework, that is compliant with H.264/AVC Annex H (referred to as MVC). As such, we address and implement the missing components in the H.264 implementation of FFmpeg according to MVC. Namely, we first extend the parsing routines to be able to handle MVC-compliant bitstream by implementing support for new NAL unit types and parameter sets. In addition, we implement new buffers and enhanced structs to store MVC-dependent data, such as the structs for SPS and PPS and buffers for Subset SPS and inter-view reference lists. Second, we extend the decoding routines by extending DBP and modifying the reference picture handling. For that reason, we modify the existing code and implement additional functions, as required by MVC. Additionally, we investigate multi-threading capabilities and optimize the implementation to be able to decode selected views only. Finally, we add configuration options by extending the command line interface with additional parameters. We test our implementation on a commodity desktop computer and achieve decoding times of 18 ms per frame on average (for a single frame in all 8 views). This means, we achieve real-time performance for sequences with up to 50 frames per second. In addition to the implementation, we perform experiments on different MVV sequences to optimize the coding of multi-view sequences in terms of quality-complexity trade-offs; we perform this by varying the prediction schemes and quantization parameters as well as the scenes. Our findings are that there is dependence of coding on scene characteristics and prediction schemes. Finally, we analyze the impact of quantization on virtual view rendering and its relation to prediction schemes. We experience, that quantization has impact on virtual view rendering while applying of prediction schemes do not.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systematic analysis of the decoding delay in multiview video

We present a framework for the analysis of the decoding delay in multiview video coding (MVC). We show that in real-time applications, an accurate estimation of the decoding delay is essential to achieve a minimum communication latency. As opposed to single-view codecs, the complexity of the multiview prediction structure and the parallel decoding of several views requires a systematic analysis...

متن کامل

Parallel implementation of Multi-view Video Decoder for Reduction in Power Consumption

Mobile platform based multi view video applications have gained significant attention due to increase in processing power of mobile processors. Significant performance improvements have been reported by using H.264/AVC based video encoding-decoding procedures. Multi-view coding (MVC) is an extension of H.264/AVC scheme employed for high performance compression of multi view videos. The increase...

متن کامل

Power Optimized Viterbi Decoder Implementation Throught Architectural Transforms

Viterbi is an algorithm for error correction in the transmission of messages. It requires coding and decoding stages in the sender and receiver, respectively. These type of algorithms are very useful for the transmission of a type of messages where some degree of error in the received message is acceptable, such as, voice and video. The coding allows some error detection and correction. In this...

متن کامل

The Emerging MVC Standard for 3D Video Services

Multiview video has gained a wide interest recently. The huge amount of data needed to be processed by multiview applications is a heavy burden for both transmission and decoding. The joint video team has recently devoted part of its effort to extend the widely deployed H.264/AVC standard to handle multiview video coding (MVC). The MVC extension of H.264/AVC includes a number of new techniques ...

متن کامل

Distributed Source Coding Application to Low-delay Free Viewpoint Switching in Multiview Video Compression

Multiview video coding (MVC) exploits the temporal and spatial redundancy between neighboring frames of the same view or that of adjacent views to achieve compression. Free viewpoint switching, however, poses challenges to MVC, as when a user is able to choose different playback paths it would become unclear to encoder which previously reconstructed frame would be available for decoding the cur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013